Entangled qutrits violate local realism stronger than qubits —an analytical proof 
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In Kaszlikowski et al. [Phys. Rev. Lett. 85, 4418 (2000)], it has been shown numerically that 
the violation of local realism for two maximally entangled iV-dimensional (3 < N) quantum 
objects is stronger than for two maximally entangled qubits and grows with iV. In this paper 
f^*) ' we present the analytical proof of this fact for N — 3. 

o ' 

Since the formulation of Bell theorem jjj], various forms of so called Bell inequalities, see for instance 0], have 
been devised to investigate the possibility (or lack of such possibility) of local realistic description of correlations 
observed in various quantum systems such as M entangled N dimensional quantum objects. The main advantage 
of this approach is its simplicity. The drawback of this method is that, in general, Bell inequalities are only a 
necessary condition for the existence of local and realistic description of the investigated quantum system. Only 
in a few cases, see for instance [||-^), it has been proved that some Bell inequalities are necessary and sufficient 
| condition for the existence of local realism. All these cases deal with two [|| or more than two qubits and 

0^ ■ two local observables measured at each side of the Bell experiment. 

In H a general approach to the problem has been presented. It is possible to find all relevant inequalities 
that have to be fulfilled by the probabilities obtained by the measurement of any number of local observables 
on the system consisting of an arbitrary number of quantum objects, each of which described by a Hilbert 
space of arbitrary dimesnion, so that it can be described in terms of local realism. However, the number of 
inequalities that have to be examined grows extremely fast with the dimension of the problem, i.e., number of 
local observables, quantum objects and dimension of Hilbert space describing given objects. This makes the 
Q_i! method practically useless, as shown in MM. 

Recent research shows that a different approach is possible. In |^,|l0| methods of numerical linear optimization 
has been successfully applied to two qubit correlations with up to ten local observables being measured at each 
side of the experiment M a nd for two TV-dimensional objects (2 < N < 16) with two local observables at each 
side of the experiment jRlO]. In this approach one does not find Bell inequalities but finds the conditions 
under which for the given quantum system and quantum observables measured on it there exists a local hidden 
variable model reproducing quantum results. Additionally, this method can be directly applied to the analysis 
. of experimental data. 

The paper fl(i[| is a good example of how important it is to know necessary and sufficient conditions for the 
existence of local realism in the given case. For instance in |l2|, it was shown for two iV-dimensional entangled 
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systems that the Clauser-Hornc-Shimony-Holt (CHSH) inequality 13 is maximally violated by the factor of 
The reason for this is that CHSH inequality is not a sufficient condition for the existence of local realism 
for two entangled objects each described by a Hilbert space of the dimension greater than two. Indeed, the 
results of |[o) show that violations of local realism increase with the dimension of the systems. 

In this paper we prove analytically that the violation of local realism for two maximally entangled qutrits 
(objects described by a three dimensional Hilbert space) observed via two unbiased three input and three output 
beamsplitters [ |l4| is stronger than for two maximally entangled qubits JTof . Earlier numerical computations 
advocated such a violation but rigorous analytical evidence has so far been lacking except for the trivial case 
of qubit. Thus it is anticipated that analytical proofs should exist for higher dimensional quantum systems. 
Our present work on qutrit therefore constitutes the first such attempt to confirm the previous numerical claim. 
Moreover, we also see that the extension from qubit to qutrit is clearly non-trivival. In fact, a comparison of 
our results with the separability condition for so called generalised Werner states |lf| can shed a new light on 
the relation between local realism and separability of bipartite quantum systems. 

We consider the Bell type experiment in which two spatially separated observers Alice and Bob measure two 
non-commuting observables A\,A2 for Alice and B\,B2 for Bob on the maximally entangled state of two 
qutrits 
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|V) = -t=(|0)a|0) b + \1)a\1)b + \2)a\2)b), 



(1) 



where \k)A and \U)b describe k-th basis state of the qutrit A and B respectively. Such a state can be prepared 
with pairs of photons with the aid of parametric down conversion (see in which case kets |/c)a and \k)s 

denotes photons propagating to Alice and Bob in mode k. 

Here we consider the special case in which both observers measure observables defined by 6-port (three input 
and three output ports) beam splitter. The extended theory of such devices can be found in (m). Here we give 
only a brief description. 

Unbiased 6-port beamsplitter, which is called fritter, Ji4[ is a device with the following property: if one photon 
enters into any single input port (out of the 3), its chances of exit are equally split between all 3 output ports. 
One can always build fritter with the distinguishing trait that the elements of its unitary transition matrix, T, 
are solely powers of the 3-rd root of unity a = exp (i27r/3), namely Tu = -^a^ -1 ^'" 1 ) . In front of i-th input 
port of the fritter we put a phase shifter that changes the phase of the incoming photon by 0,. These three 
phase shifts, which we denote for convenience as a "vector" of phase shifts = (0i, 02,03), are macroscopic 
local parameters that can be changed by the observer. Therefore, fritter together with the three phase shifters 
performs the unitary transformation t/(0) with the entries Uu = Tki exp(i0;). 

Alice and Bob measure the following observables 

A^i) ^U($m(0\U(^ +aU($^(l\UHfy + a 2 U($ i )\2)(2\W($ l ) 

B{6 j ) = t>(^)|0)(0|^(^)+ + a#$)|l><l|l^$) + « 2 C/(^)|2)(2|C>+(^), (2) 

where i,j = 1,2 and where, for instance, 0.; denotes the vector of local phase shifts for Alice in the i-th 
experiment. Please notice that we ascribe complex numbers to the results of measurements, i.e., to the "click" 
of the l-th detector we ascribe the number a 1 . The justification of such an assigment can be found in fl4j| . It 
results in a very symmetrical complex correlation function 

E{<iH,e j ) = (ii>\A(4>i)B{d j )\ii)) 

= i(exp(0 l 1 - + 0) - d]) + exp(^ - 0f + 9] - 0f) 

+ exp(0?- 01+03-0))), (3) 

where, for instance, 4>\ denotes the first phase shift at Alice's side in the i-th experiment. This correlation 
function retains the information about the correlations observed in the experiment. In fact, according to 
quantum mechanics the whole information that is accessible in the experiment are probabilities of coincidence 
firings of the detectors. It can be easily verified through the knowledge of the correlation function -E(0i, 0,-) one 
is able to calculate the probabilities of these coincidence "clicks" and in this way obtain the whole information 
about the correlations observed in the system. 

Following jIo| We define the strength of violation of local realism as the minimal noise admixture F t hr to the 
state (0) below which the measured correlations cannot be described by local realism for the given observables. 
Therefore, we assume that Alice and Bob perform their measurements on the following mixed state pF 

p F = (l-F)\ip)(il>\+Fp noise , (4) 

where < F < 1 and where p no ise is a diagonal matrix with entries equal to 1/9. This matrix is a totally 
chaotic mixture (noise), which admits a local and realistic description. For F — (pure maximally entangled 
state) local realistic description does not exist whereas for F = 1 (pure noise) it does. Therefore, there exists 
some threshold value of F, which we denote by F t h r , such that for every F < F t hr local and realistic description 
does not exist. The bigger the value of Fthri the stronger is the violation of local realism. The correlation 
function for the state (f|) reads E F {(jj i ,9 j ) = (1 - F)E{$ l ,9 j ). 

Let us now assume that Alice measures two observables defined by the following sets of phase shifts 0i = 
(0, 7r/3, — 7r/3), 02 = (0,0,0) whereas Bob measures two observables defined by the sets of phase shifts 0i = 
(0, 7r/6, — 7r/6), 02 = (0, — 7r/6, 7t/6). From numerical computations it is known |H|[ll]] that these sets of phases 
gives the highest Fthr- Straightforward calculations give the following values of the correlations functions for each 
experiment: E F (0 X , X ) = E F (0 2 , 2 ) = Q x = ^f±i - E F {$ U 2 ) = Q\, £ F (0 2 , X ) = Q 2 = -|(l + 2»). 

From these complex numbers we can create a 2 x 2 matrix Q with the entries (Q)ij = E (<fii, 6j). 

Local realism implies the following structure of the correlation function that is to reproduce the quantum 
correlation function defined above 



2 



E LH v($i,Oj)= J d\p(X)A($ i ,X)B0 j ,X), (5) 

where for trichotomic measurements A(<j>i, X) — a m and B(0j, X) — a n , (m, n — 1, 2, 3). Three- valued functions 
A((f>i, A), B(9j,X) represent the values of local measurements predetermined by local hidden variables, denoted by 
A, for the specified local settings. This expression is an average over a certain local hidden variable distribution 
p(A) of certain factorisable matrices, namely those with elements given by H l £ = A(4>i, X)B(6j, A). The symbol 
A may hide very many parameters. However, since the only possible values of A(<f>i, A) and B(6j, A) are 1, a, a 2 
there are only 9 different sequences of the values of (A((j>i, A), A((f)2, A)), and 9 different sequences of the values 
of (B(9i, A), B(02, A)), and consequently they form only 81 matrices H\. 

Therefore the structure of local hidden variable model of ELHv{4>i^j) reduces to discrete probabilistic model 
involving the average of all the 81 matrices H\. Therefore, we replace the parameter A by index k (k = 
1,2, ... , 81) to which we ascribe the matrix Hk with entries H^ = a ki+lj = 1, 2), where k\ — [(k — l)/9] — 
1, k 2 = [(k — l)/3] — 1, l\ = 1, li = k (please notice that or 1 = a 2 ) and where [x] denotes the integer part of the 
number x. It can be checked that only first 27 matrices are different, which means that it suffices to consider 
only them. With this notation the correlation function Elhv{^>i, Oj) acquires the following simple form 

27 

iw(M-) = Ep^ ( 6 ) 
k=i 

with, of course, the probabilities satisfying p k > and X^LiPfc = 1- From ELHv{$i,&j) we build the matrix 
Elhv- 

Quantum predictions in form of the matrix Q F can be recovered by local hidden variables if and only if 

27 

Q F = J2PnHn- (7) 

71 = 1 

Now, we want to find the minimal possible F for which it is still possible to recover matrix Q F using the 
probability distribution p n and factorizable matrices H n . For convenience we define a new parameter V = 1 — F. 
Then the minimal F refers to maximal V. 

Theorem: The maximal V equals Vthr — 6v ^~ 9 . 

Proof: First we observe that matrix Q v can be written in the following way 



Q V = V 



2^3 + 1 2-V3 

^ l 7. 



I + V 



2^3-1 2 + V3 
7. h i -„ 



n ■ a, (8) 



where ft = (n Xl n y ,n z ) =^ — |>^>0^ (please notice that \n\ = 1). One observes that Q v commutes with 

the matrix U = n ■ a, which has only two nonzero entries U12 = a 2 ,U2i = a and is unitary and hcrmitian. 
Furthermore, U preserves the structure of matrices H n in the sense that for every n = 1, 2, . . . , 27, UH n U = H m 
for some m = 1,2,..., 27. This is one to one mapping. One can also find that some matrices H n are invariants 
with respect to transformation U. For further considerations it is necessary to have the list of pairs (n, m) 

{(1, 8), (2, 10), (3, 24), (4, 17), (5, 19), (7, 26), (9, 15), (11, 13), (12, 27), (14, 22), (16, 20), (23, 25)}. (9) 

By considering the i-th pair (n,m) in the above list, we can define new matrices, Gi — H n + H m . Thus, 
G\ = H\ + Hg, G2 = H2 + -Hio and so forth. The remaining invariant matrices are Hq, His, H21. For 
convenience, we label them as G13 to G15, viz G13 = Hq,Gu — His, G15 — H2i- 

Suppose that we have the optimal solution (the solution for which V — Vthr ), i-e., we have the probability 
distribution p n so that Q Vthr = Y^J=i PnH n . Acting on both sides of this equation with matrix U we get another 
optimal solution with the same Vthr (matrix U commutes with Q Vthr ) but with the new probability distribution 
p' k , which can be obtained from the previous one by swapping probabilities belonging to the same pair, for 
instance, p' 10 = P2 and so on. Therefore, due to the above property, we can assume without loosing generality 
that in the optimal solution the probabilities referring to the same pair are equal. Therefore, we have reduced 
the number of relevant probabilities from 27 to 15. One can observe that every matrix G^ can be expressed by 
matrices G\, G10, G13 by multiplying them by a, a 2 and —1. For instance, Gq — aG\o, Gn = — G13 etc. Three 
matrices are the same: G 5 = G 3 , G7 = G 4 , Gn = Gg, which further reduces the number of relevant probabilities 
from 15 to 12. 
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Having in mind the above properties we can write the optimal solution in the new form 

gV thr = J2 w k G k , (10) 

fc#5,7,ll 

remembering that now the normalization condition for probabilities w k reads 

2(u>i + W 2 + W 3 + W 4 + W 6 + ^'8 + Wg + W 10 + W 12 ) + W 13 + Wu + W 15 = 1. (11) 

Due to the fact that all G k can be expressed by Gi, Gio, G13, we have 

Q Vthr = (wi + tttog + a 2 w\ 2 )Gi + (u>io + aw e + a 2 w 2 )G w 

+ [(wi 3 - w 9 ) + a(wu - w 3 ) + a 2 (wi 5 - w 4 )]Gi 3 . (12) 
Notice that G\ + Gio — G13 = so that we have 

Q VthT = [{W\ + W13 - Wg) + a(w 8 + - w 3 ) + a 2 (uii 2 + W15 ~ W4)]Gl 

+ [(w 10 + tui 3 - w 9 ) + a{w 6 + wu - w 3 ) + a 2 (w 2 + tuis - w 4 )]Gi . (13) 

Matrix Gi (with entries Gj 1 = 2, G} 2 = —a 2 , G 21 = —a, G 22 = 2) is a sum of matrices Hi,H% whereas matrix 
G10 (with entries G}J = —1, G\q = 2a 2 , GfJ = 2a, G 2 q = —1) is a sum of matrices Hu,H 22 . These four 
matrices are linearly independent so they form a basis in four dimensional space of 2 x 2 complex matrices. The 
expansion of Q Vthr in this basis reads 



Wthr 



A1G1 + A10G10, (14) 



where Ai = V t hr[{\ + + i^f)]' ^10 = v thr[{\ - 5^5 ) + 5775 )]• Because both Ai and A10 lie on the 

complex plane between complex numbers 1 and a, they can be uniquely expressed by these numbers 1 and a with 
positive coefficients, i.e., Ai = V^ hr [^(9 + 2V3) + a i(9-2-\/3)] and A 10 = V" tflr [^(9 - 2^) + a ^(9 + 2^)]. 
We can rewrite the formula (13) using the indentity 1 + a + a 2 = in the following form 

Ai = (wi + W4 + w 13 — w 9 — W12 — W15) + a(w 4 + w s + W14 — w 3 — u; 12 — Wis), 

Aio = («J4 + tUlO + l«13 - ^'2 - U?9 — ^15) + "(^4 + U>6 + M?14 ~ W% — W3 — W15). (15) 



After comparing Eq.(|15|) and Eq.(|lJ) we have 

Wl +W 4 + W13 - Wg- W12 ~ Wi 5 = "^y-(9 + 2\/3), 

w 4 + w s + w 14 , -w 3 - W12 - w 15 = -|^r(9 - 2V3), 
w 4 + w w + W13 - w 2 - wg - w 15 = -^-(9 - 2\/3), 

?« 4 + w 6 + ?«i4 - w 2 - w 3 - w 15 = -|^r(9 + 2-\/3) (16) 

Because we deal with the optimal solution for which V is maximal V — Vthr all the probabilities with negative 
sign in ([l6]) must be zero (please notice that none of the probabilities that come into (|l6|) with negative sign 
appears in any equation with a positive sign). We get 

Wl +w 4 + w 13 = ^1(9 + 2^3), 

w 4 + w 8 + wu = -^r( 9 _ 2 ^)> 

W4, + w w + wis = -^zr(9 - 2-\/3), 

^4 + ^6+^14 = ^(9 + 2^). (17) 

Now the whole probability distribution consists of Wi, W4,, We, wg, Wiq, Wis, Wu. By substracting the fourth 
equation from the second one and the third one from the first one we arrive at 
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4 A/ 4 A/ mj* 

W 6 - W 8 = -— Ktfcr, tOi - 1Uio = -^-Vt/ir. (18) 

Again, because Vt^ r is maximal, it must be w$ = wiq = 0. Thus w\ = wq = ^^-Vthr and the second and the 
third equation in ( |l6| ) become 

+ «;i4 = ^(9 - 2V3), W4 + u»i3 - ^(9 - 2V3), 

(19) 

This clearly implies 1013 = IW14 = q. Normalization condition now reads 

2(u>i + W4 + w 6 ) + W13 + Wu — 1. (20) 



A simple algebra gives 



1 sVS 

2 + ^4 = g ~ -pj-Vthr- (21) 



However, from (|l9|), we know that <? + 104 = -%p(9 — 2\/3)- Therefore 

I-^VW = ^1(9-2V5) (22) 

which gives V{/i r = 6v/ |~ 9 . This ends the proof. 

We have shown analytically that for the Bell experiment with the four trichotomic observables (||) (two at each 
side of the experiment) defined by the sets of phase shifts <j)\ = (0, tt/3, — tt/3), <j>2 — (0, 0, 0), 6\ — (0, tt/6, — tt/6), 
02 = (0, — tt/6, 7r/6) the minimal noise admixture F t hr above which local and realistic description exists is 
Fthr = 1 — Vthr = n ~®^ ■ For two maximally entangled qubits this number is < F t hr- Therefore, two 

entangled qutrits are more robust against local and realistic description than two entangled qubits. 

Although, the presented here proof cannot be easily applied to the set of arbitrary observables defined in (||) 
as it relies on the symmetry properties of matrix Q F it may be considered as the first step towards the Bell 
theorem for two entangled qutrits. 
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